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3 <110> APPLICANT: Pompejus, Markus 



4 Seulberger, Harald 

5 Hoeffken, Hans Wolfgang 

6 Doval, Jose Luis Revuelta 

7 Jimenez, Alberto 

8 Garcia, Maria Angeles Santos 



10 <120> TITLE OF INVENTION: Phosphoriboxyl- Pyrophosphate Synthetase Polypeptide 
12 <130> FILE REFERENCE: PF48687-2/DP 

14 <140> CURRENT APPLICATION NUMBER: US 10/076, 157A 

15 <141> CURRENT FILING DATE: 2002-02-15 

17 <150> PRIOR APPLICATION NUMBER: Germany, 19757755.5 

18 <151> PRIOR FILING DATE: 1997-12-23 
20 <160> NUMBER OF SEQ ID NOS : 21 

22 <170> SOFTWARE: WordPerfect 8 

24 <210> SEQ ID NO: 1 

25 <211> LENGTH: 1911 

26 <212> TYPE: DNA 

27 <213> ORGANISM: Ashbya gossypii 

2 9 <4 00> SEQUENCE: 1 

C--> 31 ggtagtcgct catcgacaga cacaatcgcg tgttctctct gaatcgtcca ttgggtgtca 60 

33 gcatcctgat cgcgggcgga tggaatgggt aatcattagg aaacaccaat gtcccatggt 120 
35 attgtccgtc ctcgtatggt gtctcaggag gacccgtgat cacgtagtgc cacaccagga 180 

3 7 tattgtcttc ctttggtgct gccacgatgt agggcggggg gttctcggtc atcattttgt 240 
39 actcctttga gagccgcttg tacgcctgtc ttgatgccat cttgcctact attagtttct 300 
41 caccacttcc cgccaaacaa tctgcacttt acgagcgcta tctatccctc gggtcgctct 360 
43 agttgattat tggcgaaact gatagttcag gtacttccat gatgcggtca tatccacgta 420 
45 tgtgatcacg tgatcatcag ccatgctgcc agctcacggg cctgcctaca ctattggagg 480 
47 ctctgtgagt catgatttat tgcatatcaa gcccagatag tcgttgggga tactaccgtt 540 
49 gccgcgatga gctccgatat taagttgtag ccaaaaattt taacggatga cttcttaaca 600 
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69 60 65 70 

71 ctg etc ate atg ate eat gee tge egg tea gee tet gcg egg aag ate 8 92 

72 Leu Leu lie Met lie His Ala Cys Arg Ser Ala Ser Ala Arg Lys lie 

73 75 80 85 

75 aca gcg gtt ata eea aac tte cet tac gea aga caa gac aaa aag gae 940 

76 Thr Ala Val lie Pro Asn Phe Pro Tyr Ala Arg Gin Asp Lys Lys Asp 

77 90 95 100 105 

79 aag teg cga gca ccg ata act gcc aag ctg gtg gcc aag atg eta gag 988 

80 Lys Ser Arg Ala Pro lie Thr Ala Lys Leu Val Ala Lys Met Leu Glu 

81 110 115 120 

83 ace gcg ggg tge aac cac gtt ate acg atg gat ttg cac gcg tet caa 1036 

84 Thr Ala Gly Cys Asn His Val lie Thr Met Asp Leu His Ala Ser Gin 

85 125 130 135 

87 att cag ggt ttc ttc cac att eca gtg gac aac eta tat gca gag ccg 1084 

88 He Gin Gly Phe Phe His He Pro Val Asp Asn Leu Tyr Ala Glu Pro 

89 140 145 150 

91 aae ate ctg cac tac ate caa eat aat gtg gac ttc eag aat agt atg 1132 

92 Asn He Leu His Tyr He Gin His Asn Val Asp Phe Gin Asn Ser Met 

93 155 160 165 

95 ttg gte gcg eca gae gcg ggg teg gcg aag cgc acg teg acg ett teg 1180 

96 Leu Val Ala Pro Asp Ala Gly Ser Ala Lys Arg Thr Ser Thr Leu Ser 

97 170 175 180 185 

99 gae aag ctg aat etc aac ttc gcg ttg ate cac aaa gaa egg eag aag 1228 

100 Asp Lys Leu Asn Leu Asn Phe Ala Leu He His Lys Glu Arg Gin Lys 

101 190 195 200 

103 geg aae gag gte teg egg atg gtg ttg gtg ggt gat gtc gcc gac aag 1276 

104 Ala Asn Glu Val Ser Arg Met Val Leu Val Gly Asp Val Ala Asp Lys 

105 205 210 215 

107 tec tgt att att gta gae gac atg gcg gac aeg tge gga aeg eta gtg 1324 

108 Ser Cys He He Val Asp Asp Met Ala Asp Thr Cys Gly Thr Leu Val 

109 220 225 230 

111 aag gcc act gac acg ctg ate gaa aat tgt gcg aaa gaa gtg att gee 1372 

112 Lys Ala Thr Asp Thr Leu He Glu Asn Cys Ala Lys Glu Val He Ala 

113 235 240 245 

115 att gtg aca cac ggt ata ttt tet ggc ggc gcc cgc gag aag ttg cgc 1420 

116 He Val Thr His Gly He Phe Ser Gly Gly Ala Arg Glu Lys Leu Arg 

117 250 255 260 265 

119 aac age aag ctg gca egg ate gta age aca aat acg gtg eca gtg gac 1468 
12 0 Asn Ser Lys Leu Ala Arg He Val Ser Thr Asn Thr Val Pro Val Asp 
121 270 275 280 

123 etc aat eta gat ate tac cac caa att gae att agt gee att ttg gcc 1516 

124 Leu Asn Leu Asp He Tyr His Gin He Asp He Ser Ala He Leu Ala 

125 285 290 295 

127 gag gca att aga agg ett cac aac ggg gaa agt gtg teg tac ctg tte 1564 
12 8 Glu Ala He Arg Arg Leu His Asn Gly Glu Ser Val Ser Tyr Leu Phe 

129 300 305 310 

131 aat aae get gtc atg tagtgctgtc agtggcagat gcatgatcgc tggectaatt 1619 

132 Asn Asn Ala Val Met 

133 315 
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135 atctgtgtaa gttgatacaa tgcagtaaat acagtacata aaactgaatg tttttcactt 
137 aggggtgctt tgttgttctg atagcgtgtg tgcgaatttg gaggtgaaag ttgaacatca 
139 cgtaatgaat acaaacaaga ttgcacatta ggaaaagcga taaattattt attatttgca 
141 actggccttt gagcgtttaa gcctgaacat ttttgccctt ttgtttgacc gtaccgttat 
143 cactcgtcct tatatatggc tatccttctc ttccggaact tcttcgagcg ta 

146 <210> SEQ ID NO: 2 

147 <211> LENGTH: 318 

148 <212> TYPE: PRT 

149 <213> ORGANISM: Ashbya gossypii 
151 <400> SEQUENCE: 2 

153 Met Ser Ser Asn Ser lie Lys Leu Leu Ala Gly Asn Ser His Pro Asp 

154 15 10 15 

156 Leu Ala Glu Lys Val Ser Val Arg Leu Gly Val Pro Leu Ser Lys lie 

157 20 25 30 

159 Gly Val Tyr His Tyr Ser Asn Lys Glu Thr Ser Val Thr He Gly Glu 

160 35 40 45 

162 Ser He Arg Asp Glu Asp Val Tyr He He Gin Thr Gly Thr Gly Glu 

163 50 55 60 

165 Gin Glu He Asn Asp Phe Leu Met Glu Leu Leu He Met He His Ala 

166 65 70 75 80 

168 Cys Arg Ser Ala Ser Ala Arg Lys He Thr Ala Val He Pro Asn Phe 

169 85 90 95 

171 Pro Tyr Ala Arg Gin Asp Lys Lys Asp Lys Ser Arg Ala Pro He Thr 

172 100 105 110 

174 Ala Lys Leu Val Ala Lys Met Leu Glu Thr Ala Gly Cys Asn His Val 

175 115 120 125 

177 He Thr Met Asp Leu His Ala Ser Gin He Gin Gly Phe Phe His He 

178 130 135 140 

180 Pro Val Asp Asn Leu Tyr Ala Glu Pro Asn He Leu His Tyr He Gin 

181 145 150 155 160 

183 His Asn Val Asp Phe Gin Asn Ser Met Leu Val Ala Pro Asp Ala Gly 

184 165 170 175 

186 Ser Ala Lys Arg Thr Ser Thr Leu Ser Asp Lys Leu Asn Leu Asn Phe 

187 180 185 190 

18 9 Ala Leu He His Lys Glu Arg Gin Lys Ala Asn Glu Val Ser Arg Met 
190 195 200 205 

192 Val Leu Val Gly Asp Val Ala Asp Lys Ser Cys He He Val Asp Asp 

193 210 215 220 

195 Met Ala Asp Thr Cys Gly Thr Leu Val Lys Ala Thr Asp Thr Leu He 

196 225 230 235 240 

198 Glu Asn Cys Ala Lys Glu Val He Ala He Val Thr His Gly He Phe 

199 245 250 255 

2 01 Ser Gly Gly Ala Arg Glu Lys Leu Arg Asn Ser Lys Leu Ala Arg He 
202 260 265 270 

204 Val Ser Thr Asn Thr Val Pro Val Asp Leu Asn Leu Asp He Tyr His 

205 275 280 285 

207 Gin He Asp He Ser Ala He Leu Ala Glu Ala He Arg Arg Leu His 

208 290 295 300 

210 Asn Gly Glu Ser Val Ser Tyr Leu Phe Asn Asn Ala Val Met 



1679 
1739 
1799 
1859 
1911 
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211 305 



310 



315 



214 <210> SEQ ID NO: 3 

215 <211> LENGTH: 5369 

216 <212> TYPE: DNA 

217 <213> ORGANISM: Ashbya gossypii 
219 <400> SEQUENCE: 3 

C--> 222 aagcttgacc ttggctggca cttgagtcgg cagacaggtg gactaacccg agca atg 57 

223 Met 

224 1 

226 gat cgt ggt tgt aaa ggt ate tct tat gtg etc agt gca atg gtt ttt 105 

22 7 Asp Arg Gly Cys Lys Gly lie Ser Tyr Val Leu Ser Ala Met Val Phe 
228 5 10 15 

230 cac ata ata ccg att aca ttt gaa ata teg atg gta tgt ggc ata ttg 153 

231 His He He Pro He Thr Phe Glu He Ser Met Val Cys Gly He Leu 

232 20 25 30 

234 aca tac cag ttt ggt get tec ttc get get ata aca ttc teg act atg 201 

235 Thr Tyr Gin Phe Gly Ala Ser Phe Ala Ala He Thr Phe Ser Thr Met 

236 35 40 45 

23 8 ctt ctt tac tec ate ttt act ttc aga acg aeg geg tgg egc aca egg 249 
23 9 Leu Leu Tyr Ser He Phe Thr Phe Arg Thr Thr Ala Trp Arg Thr Arg 

240 50 55 60 65 

242 ttt agg egt gat gcg aac aag get gae aat aag gee get agt gtg gca 2 97 

243 Phe Arg Arg Asp Ala Asn Lys Ala Asp Asn Lys Ala Ala Ser Val Ala 

244 70 75 80 

246 ttg gat tee eta ata aat ttt gaa get gta aag tat tte aat aac gag 345 

247 Leu Asp Ser Leu He Asn Phe Glu Ala Val Lys Tyr Phe Asn Asn Glu 

248 85 90 95 

250 aag tac ctt geg gae aag tat eae aea tee ttg atg aag tac egg gat 3 93 

251 Lys Tyr Leu Ala Asp Lys Tyr His Thr Ser Leu Met Lys Tyr Arg Asp 

252 100 105 110 

254 tee eag ata aag gte teg eaa teg etg geg ttt ttg aac aee gge eag 441 

255 Ser Gin He Lys Val Ser Gin Ser Leu Ala Phe Leu Asn Thr Gly Gin 

256 115 120 125 

258 aae eta att ttt aee act gca etg act gea atg atg tat atg gcc tgt 489 
25 9 Asn Leu He Phe Thr Thr Ala Leu Thr Ala Met Met Tyr Met Ala Cys 
260 130 135 140 145 

262 aat ggt gtt atg cag ggc tct ctt aca gtg ggg gat ctt gtg tta att 537 

263 Asn Gly Val Met Gin Gly Ser Leu Thr Val Gly Asp Leu Val Leu He 

264 150 155 160 

266 aat eaa etg gta tte eag etc tec gtg cea eta aac ttc ctt ggt age 585 

267 Asn Gin Leu Val Phe Gin Leu Ser Val Pro Leu Asn Phe Leu Gly Ser 

268 165 170 175 

270 gtc tac cgt gat etc aag cag tct etg ata gat atg gaa tct tta ttt 633 

271 Val Tyr Arg Asp Leu Lys Gin Ser Leu He Asp Met Glu Ser Leu Phe 

272 180 185 190 

274 aaa etg eaa aaa aat cag gte aca att aag aac tec cea aat gcc cag 681 

275 Lys Leu Gin Lys Asn Gin Val Thr He Lys Asn Ser Pro Asn Ala Gin 

276 195 200 205 

2 78 aac eta cea ata eae aaa ccg ttg gat att cgc ttt gaa aat gtt aeg 72 9 
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279 Asn Leu Pro lie His Lys Pro Leu Asp lie Arg Phe Glu Asn Val Thr 

280 210 215 220 225 

282 ttt ggc tat gac ccg gag egg cgt ata ttg aac aat gtt teg ttt acc 777 
2 83 Phe Gly Tyr Asp Pro Glu Arg Arg He Leu Asn Asn Val Ser Phe Thr 
284 230 235 240 

286 ate cca get gga atg aag aet gee ata gta gge cea teg ggc teg ggg 825 

287 He Pro Ala Gly Met Lys Thr Ala He Val Gly Pro Ser Gly Ser Gly 

288 245 250 255 

2 90 aag tee aee att ttg aag etc gta ttt aga tte tat gag ccc gag caa 873 

291 Lys Ser Thr He Leu Lys Leu Val Phe Arg Phe Tyr Glu Pro Glu Gin 

292 260 265 270 

2 94 ggt egt ate eta gtt gge gge aca gat ate ege gat tta gae ttg ett 921 

295 Gly Arg He Leu Val Gly Gly Thr Asp He Arg Asp Leu Asp Leu Leu 

296 275 280 285 

298 tct tta egg aag get ate ggt gte gtg eee eaa gat aet cct ete tte 969 

299 Ser Leu Arg Lys Ala He Gly Val Val Pro Gin Asp Thr Pro Leu Phe 

300 290 295 300 305 

302 aat gae aca ate tgg gag aat gtt aaa tte ggc aat ate agt tec tct 1017 

303 Asn Asp Thr He Trp Glu Asn Val Lys Phe Gly Asn He Ser Ser Ser 

304 310 315 320 

306 gae gat gag att etc agg gee ata gaa aaa get caa etc aeg aag eta 1065 

3 07 Asp Asp Glu He Leu Arg Ala He Glu Lys Ala Gin Leu Thr Lys Leu 
308 325 330 335 

310 ete eag aac eta cea aag gge get tee aee gtt gta ggg gag cgc ggt 1113 

311 Leu Gin Asn Leu Pro Lys Gly Ala Ser Thr Val Val Gly Glu Arg Gly 

312 340 345 350 

314 ttg atg ate age gga ggt gag aaa caa agg ett get att get cgt gtg 1161 

315 Leu Met He Ser Gly Gly Glu Lys Gin Arg Leu Ala He Ala Arg Val 

316 355 360 365 

318 ett ttg aag gae get eeg etg atg ttt tte gae gag get aca agt get 1209 

319 Leu Leu Lys Asp Ala Pro Leu Met Phe Phe Asp Glu Ala Thr Ser Ala 

320 370 375 380 385 

322 etg gat aca cac aca gag cag gca etc ttg cac acc att cag cag aac 1257 

323 Leu Asp Thr His Thr Glu Gin Ala Leu Leu His Thr He Gin Gin Asn 

324 390 395 400 

326 ttt tct tee aat tea aag aeg age gtt tac gtt gee eat aga etg ege 1305 

32 7 Phe Ser Ser Asn Ser Lys Thr Ser Val Tyr Val Ala His Arg Leu Arg 
328 405 410 415 

330 aca ate get gat gea gat aag ate att gtt ett gaa eaa ggt tet gte 1353 

331 Thr He Ala Asp Ala Asp Lys He He Val Leu Glu Gin Gly Ser Val 

332 420 425 430 

334 ege gaa gag ggc aca cac age teg etg tta geg tea eaa gga tec eta 1401 

33 5 Arg Glu Glu Gly Thr His Ser Ser Leu Leu Ala Ser Gin Gly Ser Leu 
336 435 440 445 

33 8 tac egg ggt etg tgg gat att cag gaa aac eta aeg ett ccg gaa egg 1449 
33 9 Tyr Arg Gly Leu Trp Asp He Gin Glu Asn Leu Thr Leu Pro Glu Arg 
340 450 455 460 465 

342 cct gag cag tea acc gga tct cag cat gca tagacgtctg actagagatt 1499 

343 Pro Glu Gin Ser Thr Gly Ser Gin His Ala 
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VERIFICATION SUMMARY DATE: 10/22/2 004 

PATENT APPLICATION: US/10/076 , 157A TIME: 12:20:08 

Input Set : As\076157.txt 

Output Set: N:\CRF4\10222004\J076157A.raw 

L:31 M:112 C: (48) String data converted to lower case, 
M:112 Repeated in SeqNo=l 

L:222 M:112 C: (48) String data converted to lower case, 
M:112 Repeated in SeqNo=3 

L:356 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:496 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:500 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:504 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:508 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:512 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:516 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:520 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:524 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:528 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:532 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:536 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:540 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:544 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:548 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:552 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:556 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:560 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:564 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:568 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:572 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:576 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:580 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:584 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:588 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:902 M:112 C: (48) String data converted to lower case, 
M:112 Repeated in SeqNo=7 

hi 976 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 7 
L:1229 M:112 C: (48) String data converted to lower case, 
M:112 Repeated in SeqNo=10 

L:1515 M:112 C: (48) String data converted to lower case, 
M:112 Repeated in SeqNo=12 
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